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ABSTRACT Human coronavirus HKU1 (HCoV-HKUl) was first detected in a patient 
with viral pneumonia from Hong Kong in 2004. Here, we report the first complete 
genome sequence of HCoV-HKUl from Thailand, obtained from a nonill person who 
worked in a bat cave. Phylogenetic tree analysis revealed it as a group B HCoV- 
HKUl. 


T here are four species of endemic human coronavirus (HCoV) currently recognized 
by the International Committee for the Taxonomy of Viruses, namely, HCoV-OC43, 
-229E, -NL63, and -HKU1, and two epidemic CoVs, including severe acute respiratory 
syndrome (SARS)-CoV and the Middle East respiratory syndrome (MERS)-CoV discov¬ 
ered in 2003 and 2012, respectively (1). Bats are believed to be the ancestral hosts of 
alpha- and beta-CoV, including SARS-CoV and MERS-CoV (2, 3). A group C betacoro- 
navirus (MERS-related CoV) was detected from dry bat guano collected from a cave in 
Thailand where bat guano is sold for use as fertilizer (4). Nasopharyngeal swabs of bat 
guano miners (n = 34) in Ratchaburi Province, Thailand, were collected and sent to the 
Thai Red Cross-Emerging Infectious Diseases laboratory to test for CoVs using conven¬ 
tional PCR targeting the betacoronavirus RNA-dependent RNA polymerase (RdRp) gene 
(5). One sample was positive for the coronavirus HKU1 strain, and its whole genome 
was sequenced by using next-generation sequencing (NGS). 

The nasopharyngeal swab found CoV positive by conventional PCR was subjected to 
metagenomic sequencing on the lllumina MiSeq platform. RNA was extracted using a 
QIAamp viral RNA mini kit (Qiagen, Germany), followed by DNase treatment. A DNA 
library was prepared using a TruSeq total RNA with Ribo-zero globin kit (lllumina). The 
quality and quantity of the DNA library was estimated by using the QIAxcel Advanced 
system and QIASeq library quantification kit, respectively (Qiagen, Germany). The 17 
pmol of DNA library was injected into the flow cell and sequenced using a MiSeq 
reagent kit version 3. 

A total of 59,704,490 sequencing reads from 2 X 150-bp paired ends which, passing 
a quality score of 30, was used for data analysis. Host DNA (human genome from GATK 
resource bundle) removal was done using the SNAP (version 0.15.4) alignment tool (6). 
Mapping assembly was performed using HKU1 (GenBank accession number AY884001) 
as the reference strain by using the Burrows-Wheeler Aligner (BWA) program (7). A total 
of 6,475,827 reads mapped to HCoV-HKUl at a depth of 29,000X, which covered the 
whole genome of 29,811 bp. The consensus sequence of HCoV-HKUl with a G+C 
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FIG 1 Phylogenetic tree of HCoV-HKUl using maximum likelihood method with 1,000 bootstrap replicates. The whole-genome sequence of HCoV-HKUl 
(29,811 bp) from Thailand is colored in blue; it belonged to group B genotype and was closely related to HCoV-HKUl from Hong Kong (GenBank accession 
number AY884001). 


content of 65% was generated using SAMtools mpileup (8). This is the first complete 
genome of HCoV-HKUl isolated from Thailand. 

Phylogenetic analysis of our HCoV-HKUl whole-genome and reference sequences 
from the GenBank database, using maximum likelihood algorithm on the MEGA6 
program, showed that our sequence is in the same clade with sequences from China 
and the United States (Fig. 1). It belonged to the group B genotype and was closely 
related to HCoV-HKUl from Hong Kong (GenBank accession number AY884001) that 
was isolated in 2006 (99% nucleotide identity) (9). It is worth noting that this virus was 
detected from an individual with a high level of occupational exposure to bat feces and, 
thus, an elevated level of zoonotic virus spillover. While this individual was likely 
exposed to HCoV-HKUl due to person-to-person transmission and not via exposure 
from bats, our surveillance strategy and viral characterization pipeline provide valuable 
insight into the circulation of endemic infectious diseases in Thailand and increase the 
country's preparedness for other novel emerging infectious diseases. 

Data availability. The human coronavirus HKU1 strain reported here was depos¬ 
ited in GenBank under the accession number MH940245 and SRA accession number 
PRJNA509533. 
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